Deep Robust Kalman Filter
نویسندگان
چکیده
A Robust Markov Decision Process (RMDP) is a sequential decision making model that accounts for uncertainty in the parameters of dynamic systems. This uncertainty introduces difficulties in learning an optimal policy, especially for environments with large state spaces. We propose two algorithms, RTD-DQN and Deep-RoK, for solving large-scale RMDPs using nonlinear approximation schemes such as deep neural networks. The RTD-DQN algorithm incorporates the robust Bellman temporal difference error into a robust loss function, yielding robust policies for the agent. The Deep-RoK algorithm is a robust Bayesian method, based on the Extended Kalman Filter (EKF), that accounts for both the uncertainty in the weights of the approximated value function and the uncertainty in the transition probabilities, improving the robustness of the agent. We provide theoretical results for our approach and test the proposed algorithms on a continuous state domain.
منابع مشابه
Robust Tracking Control of Satellite Attitude Using New EKF for Large Rotational Maneuvers
Control of a class of uncertain nonlinear systems, which estimates unavailable state variables, is considered. A new approach for robust tracking control problem of satellite for large rotational maneuvers is presented in this paper. The features of this approach include a strong algorithm to estimate attitude, based on discrete extended Kalman filter combined with a continuous extended Kalman ...
متن کاملRobust Tracking Control of Satellite Attitude Using New EKF for Large Rotational Maneuvers
Control of a class of uncertain nonlinear systems, which estimates unavailable state variables, is considered. A new approach for robust tracking control problem of satellite for large rotational maneuvers is presented in this paper. The features of this approach include a strong algorithm to estimate attitude, based on discrete extended Kalman filter combined with a continuous extended Kalman ...
متن کاملDesign of Instrumentation Sensor Networks for Non-Linear Dynamic Processes Using Extended Kalman Filter
This paper presents a methodology for design of instrumentation sensor networks in non-linear chemical plants. The method utilizes a robust extended Kalman filter approach to provide an efficient dynamic data reconciliation. A weighted objective function has been introduced to enable the designer to incorporate each individual process variable with its own operational importance. To enhance...
متن کاملSensorless Speed Control of Double Star Induction Machine With Five Level DTC Exploiting Neural Network and Extended Kalman Filter
This article presents a sensorless five level DTC control based on neural networks using Extended Kalman Filter (EKF) applied to Double Star Induction Machine (DSIM). The application of the DTC control brings a very interesting solution to the problems of robustness and dynamics. However, this control has some drawbacks such as the uncontrolled of the switching frequency and the strong ripple t...
متن کاملEstimation of LOS Rates for Target Tracking Problems using EKF and UKF Algorithms- a Comparative Study
One of the most important problem in target tracking is Line Of Sight (LOS) rate estimation for using from PN (proportional navigation) guidance law. This paper deals on estimation of position and LOS rates of target with respect to the pursuer from available noisy RF seeker and tracker measurements. Due to many important for exact estimation on tracking problems must target position and Line O...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1703.02310 شماره
صفحات -
تاریخ انتشار 2017